® 



J 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 




© Publication number: 



0 631 444 A1 



® 



EUROPEAN PATENT APPLICATION 
published In accordance with Art. 
158(3) EPC 



@ Application number: 93905614.9 

(g> Date of filing: 03.03.93 

© International application number: 
PCT/JP93/00275 

® Internationa! publication number: 
WO 93/18618 (16.09.93 93/22) 



© Int.CLS: H04N 7/137 



® Priority: 03.03.92 JP 45678/92 
31.08.92 JP 253466/92 
28.12.92 JP 349536/92 

® Date of publication of application: 
28.12.94 Bulletin 94/52 

® Designated Contracting States: 
DE PR GB 

@ Applicant: KABUSHIKI KAISHA TOSHIBA 
72, Horikawa-cho 
Saiwal-ku 
Kawasaki-Shi 
Kanagawa-ken 210 (JP) 

(g) Inventor: UENO, HIdeyuki 
2-8-16, Daita, 
Setagaya-ku 
Tokyo 155 (JP) 

Inventor: KIKUCHI, Yoshihiro, Creare Toshiba 
Yoshinocho 604 
1-10-1, Takasagocho, 



Minami-ku 

Yokohama-shi, 

Kanagawa-ken 232 (JP) 

Inventor: YAMAGUCHI, Noboru^^ 

776, Ohsone 

Yashio-shi, 

Saitama-ken 340 (JP) 

Inventor: ODAKA, Toshlnori 

3-31-12-406, Bessho, 

Minami-ku 

Yokohama-slit, 

Kanagawa-ken 232 (JP) 

Inventor: OKU, Tadahiro 

3-16-46, Fujimi 

Urayasu-shi, 

Chiba-ken 279 (JP) 



© Representative: Freed, Artliur Woolf et al 
MARKS & CLERK, 
57-60 Lincoln's Inn Fields 
London WC2A 3LS (GB) 



00 
CD 



^ 0 TlME-VARYiNG IMAGE ENCODER. 



® Effectively using the ability of prediction by 
means of low-definition local decoded signal for pre- 
dictive coding, the coding efficiency of a time-vary- 
ing image encoder is improved. In a time-varying 
image encoder in which high-definition picture signal 
is subjected to predictive coding and low-definition 
picture signal produced by the conversion of the 
high-definition picture signal is encoded; a low-defi- 
nition picture is created by removing one field from 
one frame through a field skipping circuit (102) (thin- 



ning) and by subjecting the other field to down- 
sampling through a down-sampling circuit (29), low- 
definition prediction signal is produced by subjecting 
the low-definition picture to up-sampling through an 
up-sampling circuit (35), high-definition prediction 
signal corresponding to the removed field is pro- 
duced from the high-definition picture, and prediction 
signal is produced by combining alternately the low- 
and high-definition prediction signals through predic- 
tor (104). 
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Technical Field 

The present invention relates to a motion pic- 
ture coding apparatus, and, more particularly, to a 
motion picture coding apparatus for use in a sys- 
tem, which transmits motion pictures over a line, 
such as a TV conference or a TV telephone, a 
system for storing motion pictures on a storage 
medium, such as an optical disk or a video tape, 
and digital television broadcasting. 

Today, standardization for a scheme for trans- 
mitting motion pictures, which has high quality 
equal to or higher than the quality for the current 
TV system, at a rate of about several Mbps to 
several tens of Mbps has progressed In the com- 
munication field for future establishment of the ser- 
vice in a broad-band network (H. 26X). Likewise, in 
the fields of the storage system and broadcasting 
system, standardization for a coding system which 
has about the same picture quality and bit rate as 
H. 26X has progressed (MPEG2 for the storage 
system and CMTT/2 for the broadcasting system). 
Since it is considered advantageous for populariza- 
tion to establish systems as common as possible in 
those standardizations, the standardizations have 
progressed while mutually exchanging information. 

In the communication system and storage sys- 
tem, there already are H. 261 and MPEGI, which 
are standard systems for motion picture coding 
whose targets are lower picture quality and lower 
bit rate. Specifically, the supply of hardware for 
those standardizations has started. Therefore, one 
of issues is that a new standard system to achieve 
higher picture quality should have mutual connec- 
tability (compatibility) with those existing standard 
systems. There are two types of compatibility: for- 
ward compatibility which permits a decoder of the 
new system to decode a bit stream prepared by an 
encoder of an existing system and backward com- 
patibility which permits a decoder of the existing 
system to decode a part of a bit stream prepared 
by an encoder of the new system. In consideration 
of the fact that the decoder of the existing system 
has already been defined whereas the decoder of 
the new system has been undefined, the backward 
compatibility would be a severe requirement for the 
coding system. 

There may be various systems for ensuring the 
backward compatibility. As one of those methods 
that can define the compatibility as an option, a 
system has been proposed which has a local de- 
coded signal of an existing system included as a 
candidate for a predictive signal using in the new 
system and forms a bit stream In such a way that a 
part of the bit stream by the new system becomes 
a bit stream by the existing system (which is called 
embedded coding using hierarchical coding). 



Chapter 6 of "International Standardization of 
Multimedia Coding" (published by Maruzen) de- 
scribes MPEGI, which is one of the existing sys- 
tems. In that chapter, the section 6.3 "Coding Al- 
5 gorithm" describes an Inter-picture predicting 
structure which consists of an I picture (intra-pre- 
dictron picture), P picture (forward prediction pic- 
ture) and B picture (both forward and backward 
prediction pictures), with reference to Fig. 6.2. In 

10 the description, it Is expected that the new system 
also takes an inter-picture prediction structure simi- 
lar to that of the existing system. While "image = 
picture" is a "frame" in the existing system, such 
as MPEG1. due to a picture signal having a non- 

15 interlaced structure, however, it may become 
"field" having an interlaced structure in the new 
system, making the prediction structure slightly 
complicated. 

In the aforementioned embedded coding using 

20 hierarchical coding, prediction using a local de- 
coded signal by the existing system is added as an 
option, and this prediction Is selected only when it 
is better than the prediction originated from a local 
decoded signal by the new system. In other words. 

25 a better one of a local decoded signal from a local 
decoder of the new system and a local decoded 
signal that is input via an up-sampling circuit from 
a local decoder of the existing system is selectively 
used as a predictive signal in a coding section. It is 

30 therefore considered that the coding efficiency will 
not be deteriorated by the inclusion of the latter 
prediction from the local decoded signal by the 
existing system. Conventionally, however, only the 
prediction error power Is considered in the mode 

35 discrimination for selecting a predictive signal used 
in the coding section and that predictive signal 
which has smaller prediction error power is se- 
lected, so that the prediction using the local de- 
coded signal of the existing system is not suffi- 
ce ciently utilized. That is, while the amount of in- 
formation generated at the time of coding a predic- 
tion error signal is reduced, the amount of Informa- 
tion as the entire coded output including motion 
vector information should not necessarily be re- 

45 duced. This stands in the way of improving the 
coding efficiency. 

For the I picture among individual pictures 
classified by the prediction structure, the prediction 
by the existing system is effective and will be 

50 selected accordingly, whereas for the P picture and 
B picture, the prediction originating from a local 
decoded signal for the coding result by the existing 
system is not so effective and the chance of its 
being selected becomes very low. It is known that 

55 one cause for this depends on the way of prepar- 
ing a picture to be coded by the existing system. 

Further, according to the conventional embed- 
ded coding using the hierarchical coding, in the 
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case where a low-resolution picture is prepared by 
dropping one field off. the field that has not been 
used in preparing the low-resolution picture will not 
be well predicted by a low-resolution predictive 
signal, thus lowering the entire prediction efficien- 
cy. 

As described above, according to the conven- 
tional embedded coding using the hierarchical cod- 
ing, since only the prediction error power is consid- 
ered in the mode discrimination for selecting a 
predictive signal in the coding by the new system, 
the prediction using the local decoded signal of the 
existing system cannot be utilized sufficiently, 
which stands in the way of Improving the coding 
efficiency. 

In addition, according to the conventional em- 
bedded coding using the hierarchical coding, for 
the I picture among individual pictures classified by 
the prediction structure, the prediction by the exist- 
ing system is effective and will be selected accord- 
ingly, whereas for the P picture and B picture, the 
prediction originating from a local decoded signal 
by the existing system is not so effective and the 
chance of its being selected becomes very low. 

It is an object of this invention to provide a 
motion picture coding apparatus, which will over- 
come those conventional problems and can im- 
prove the coding efficiency by effectively utilizing 
the predictive ability using a low-resolution local 
decoded signal in the predictive coding of a high- 
resolution picture signal. 

Disclosure of the Invention 

According to this invention, there is provided a 
motion picture coding apparatus, which performs 
predictive coding on a high-resolution picture sig- 
nal, thins out one field for each frame of a picture, 
down-samples the other field, thereby forming a 
low-resolution picture, and encodes it, character- 
ized in that a low-resolution predictive signal, ob- 
tained by up-sampling a low-resolution local de- 
coded picture and an optimal predictive signal of a 
high-resolution predictive signal are selected for 
each field to form a predictive signal, and is used 
in high-resolution coding. 

Further, according to this invention, a motion 
picture coding apparatus for performing predictive 
coding on a high- resolution picture signal and cod- 
ing a low-resolution picture signal obtained by con- 
verting the high-resolution picture signal, character- 
ized in that as a determination condition at the time 
o1 selecting a predictive signal, used in predictive 
coding of a high-resolution picture signal, from a 
high-resolution predictive signal obtained from a 
high-resolution local decoded signal attained by 
decoding the result of coding the high-resolution 
picture signal, and a low-resolution predictive signal 



obtained from a signal attained by up-sampling a 
low-resolution local decoded signal originating from 
the decoding of the result of coding the low-resolu- 
tion picture signal, prediction error power at the 
5 time the high-resolution predictive signal and the 
low-resolution predictive signal are used in predic- 
tive coding of the high-resolution picture signal and 
the amount of additional information, such as mo- 
tion vector information, are considered. 
10 Furthermore, this invention is characterized in 

that there is a mode for performing predictive cod- 
ing on a high-resolution prediction error signal, 
which is the difference between a prediction high- 
resolution picture signal and a high-resolution pre- 
75 dictive signal, using a low-resolution prediction er- 
ror signal, a high-resolution predictive signal under- 
gone motion compensation is produced using a 
motion vector between a high-resolution local de- 
coded signal and a high-resolution picture signal, a 
20 low-resolution predictive signal, undergone motion 
compensation, is produced from a signal obtained 
by up-sampling a signal based on which motion 
compensation in the coding of the low-resolution 
picture signal is carried out, and the difference 
25 between the low-resolution predictive signal and 
the up-sampled signal of the low-resolution local 
decoded signal is attained to yield the low-resolu- 
tion prediction error signal. 

At the time of preparing a low-resolution pre- 
30 dictive signal, means for shifting a low-resolution 
local decoded signal and means for determining 
the shift amount based on the motion vector of the 
high-resolution picture signal are used. 

Even in the case where whether or not to use 
35 prediction by the low-resolution picture signal is 
switched every field and prediction from the low- 
resolution picture in both fields is not selected due 
to poor prediction from the low-resolution picture 
for the prediction of one field according to the prior 
40 art, the field which provides better prediction from 
the low-resolution picture will be selected, thus 
increasing the entire coding efficiency. Further, the 
introduction of shift can improve the prediction of 
that field which has provided poor prediction, fur- 
45 ther improving the coding efficiency. 

According to this invention, as described 
above, the use of a low-resolution local decoded 
signal by the existing system becomes effective as 
a predictive signal not only for an 1 picture but also 
50 for a P picture and a B picture, thus improving the 
entire coding efficiency. 

Also, in a decision for selecting a high-resolu- 
tion predictive signal and a low-resolution predic- 
tive signal, a predictive signal obtained by using a 
55 low-resolution local decoded signal based on the 
existing system is more effectively selected, by 
taking an amount of additional Information such as 
motion vector information as well as a prediction 



5 



EP 0 631 444 A1 



6 



error power in consideration. 

Tlie coding efficiency can be improved by fur- 
ther performing predictive coding on a high-resolu- 
tion prediction error signal, which has remained 
because sufficient matching has not been done by 5 
the motion compensation of the low-resolution by 
the new system alone, by using prediction error by 
the existing system. 

Furthermore, according to this invention, there 
is provided a motion picture coding apparatus for io 
performing predictive coding on a high-resolution 
picture signal and performing predictive coding of a 
low-resolution picture signal obtained by converting 
the highrresolution picture signal, comprising first 
predicting means for producing a high-resolution 75 
predictive signal based on a high-resolution local 
decoded signal decoded from a signal resulting 
from predictive coding of the high-resolution pic- 
ture signal; up-sampling means for up-sampling a 
low-resolution local decoded signal decoded from a 20 
signal resulting from predictive coding of the low- 
resolution picture: second predicting means for 
producing a low-resolution predictive signal based 
on an output signal of the up-sampling means; and 
third predicting means for producing one of the 25 
high-resolution predictive signal, the low-resolution 
predictive signal and signals obtained by weighting 
and adding the high-resolution predictive signal 
and the low-resolution predictive signal, as a pre- 
dictive signal used when performing predictive cod- 30 
ing on the high-resolution picture signal. 

The first up-sampling means comprises (a) in- 
tra-field interpolation-signal producing means for 
producing an intra-field interpolation signal consist- 
ing only of one field of a signal of the low-resolu- 35 
tion local decoded signal, (b) intra-frame interpola- 
tion-signal producing means for producing an intra- 
frame interpolation signal formed of consecutive 
two fields of signals of the low-resolution local 
decoded signal, and (c) selecting means for select- 4o 
ing, as the output signal of the up-sampling means, 
that one signal among the intra-field interpolation 
signal, the intra-frame interpolation signal and sig- 
nals obtained by weighting and adding the inter- 
polation signals, which is adaptively determined 46 
based on at least one of the high-resolution picture 
signal and other information necessary for coding. 

The second up-sampling means comprises (a) 
first and second motion-compensation up-sampling 
means for performing motion-compensation up- 50 
sampling on consecutive two fields of the low- 
resolution local decoded signal, and (b) motion- 
vector detecting means for detecting a motion vec- 
tor used in the first and second motion-compensa- 
tion up-sampling means, referring to the high-reso- 65 
lution picture signal. 

The third up-sampling means comprises (a) 
first and second motion-compensation up-sampling 



means for performing motion-compensation up- 
sampling on consecutive two fields of the low- 
resolution local decoded signal, and (b) vector 
computing means for computing a motion vector 
used in the first and second motion-compensation 
up-sampling means, based on at least one of a 
motion vector used in predictive coding of the low- 
resolution picture signal and a motion vector used 
in predictive coding of the high-resolution picture 
signal. In this case, the motion vector may be 
obtained by searching around the computed mo- 
tion vector as a reference. 

According to this invention, there is further 
provided weighting-coefficient determining means 
for determining a weighting coefficient used in the 
weighting and adding in the third predicting means, 
using at least one of (a) a motion vector used in 
predictive coding of the low-resolution picture sig- 
nal, (b) a motion vector used In predictive coding of 
the high-resolution picture signal, and (c) a motion 
vector used in motion-compensation up-sampling 
when the up-sampling means performs the motion- 
compensation up-sampling. 

According to the first up-sampling means, at 
the time the low-resolution local decoded signal Is 
up-sampled and a low-resolution predictive signal 
is prepared from the up-sampled signal, adaptive 
up-sampling according to the motion of a picture 
can be performed by selectively using one of either 
the intra-field interpolation signal or intra-frame in- 
terpolation signal of the low-resolution local de- 
coded signal, and a signal obtained by weighting 
and adding those signals, which is adaptively de- 
termined based on at least one of the high-resolu- 
tion picture signal and the other information neces- 
sary for coding. 

This prevents the efficiency from deteriorating 
by the turn-around In a motionless area when only 
the intra-field up-sampling is used, or from deterio- 
rating by time-different signal components entering 
when only the intra-frame up-sampling is used, 
thus ensuring efficient prediction in a dynamic area 
as well as a motionless area. In this case, if the 
weight for weighting and adding the intra-field inter- 
polation signal and the intra-frame interpolation sig- 
nal is adaptively determined using a high-resolution 
picture signal as a reference picture, the optimum 
weighting coefficient for predicting a high-resolution 
picture signal will be determined, thus further im- 
proving the coding efficiency. 

According to the second up-sampling means, 
at the time the up-sampling of a low-resolution 
local decoded signal is subjected to motion-com- 
pensation up-sampling, the motion vector for mo- 
tion-compensation up-sampling is determined with 
a high-resolution picture signal as a reference pic- 
ture signal, the optimal motion vector for predictive 
coding of the high-resolution picture signal will be 
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determined, thus improving the coding efficiency. 

Further, according to the third up-sampling 
means, at the time the up-sampling ot a low- 
resolution local decoded signal is subjected to mo- 
tion-compensation up-sampling, it is not particularly 5 
necessary to send the motion vector necessary to 
the motion-compensation up-sampling by perform- 
ing computation using the motion vector that is 
used in the predictive coding of the low-resolution 
picture signal or the predictive coding of the high- io 
resolution picture signal. In the case where search 
is conducted around the computed motion vector 
taken as a reference to obtain a motion vector, the 
searching range for the motion vector can be nar- 
rowed as compared with the case where the mo- 75 
tiop vector for motion-compensation up-sampling Is 
directly obtained and sent out. Therefore, the 
amount of motion vector information as well as 
hardware will be reduced. 

20 

Brief Description of Drawings 

Fig. 1 is a block diagram of a motion picture 
coding apparatus according to a first embodi- 
ment of this invention; 25 
Fig. 2 is a diagram showing the relationship 
between to-be-coded pictures of various sizes in 
the embodiment in Fig. 1 ; 

Fig. 3 presents diagrams showing selectable 
candidates for low-resolution prediction in the 30 
embodiment in Fig. 1 ; 

Fig. 4 is a diagram showing the structure of a 
predictor and prediction-mode decision circuit in 
the embodiment in Fig. 1 ; 

Fig. 5 is a diagram showing another structure of 36 
the predictor and prediction-mode decision cir- 
cuit in the embodiment in Fig. 1; 
Fig. 6 is a diagram showing how the amount of 
shift of low resolution is computed in the em- 
bodiment in Fig. 1; 40 
Fig. 7 is a block diagram of a motion picture 
coding apparatus according to a second em- 
bodiment of this invention; 
Fig. 8 is a diagram showing how to produce a 
low-resolution picture signal by dropping off one 45 
field, and an advantage of performing motion 
compensation on a low-resolution local decoded 
signal; 

Fig. 9 is a diagram showing a decision function 
at the time of selecting a low-resolution predic- so 
five signal and a high-resolution predictive signal 
in a prediction-mode decision circuit in Fig. 7; 
Fig. 10 is a block diagram showing an example 
of the structure of the prediction-mode decision 
circuit in Fig. 7; 55 
Fig. 11 is a diagram showing another method of 
producing a signal for predicting one field which 
has been dropped off when a low-resolution 



picture signal has been prepared by dropping 
off one field; 

Fig. 12 is a diagram tor explaining a method of 
using both of a low-resolution picture signal and 
a high-resolution picture signal in prediction; 
Fig. 13 is a block diagram of a motion picture 
coding apparatus according to a third embodi- 
ment of this invention; 

Fig. 14 is a diagram for explaining a predictive 
coding operation for a high-resolution picture 
signal in the embodiment in Fig. 13; 
Fig. 15 is a block diagram of a motion picture 
coding apparatus according to a fourth embodi- 
ment of this invention; 

Fig. 16 is a diagram showing a first example of 
procedures for down-sampling a high-resolution 
Input signal to a low-resolution signal in a down- 
sampling circuit in the motion picture coding 
apparatus in Fig. 1 5; 

Fig. 17 is a diagram showing a second example 
of procedures for down-sampling a high-resolu- 
• tion input signal to a low-resolution signal in a 
down-sampling circuit in the motion picture cod- 
ing apparatus in Fig. 15; 

Fig. 18 is a diagram showing procedures for up- 
sampling a low-resolution signal to a high-reso- 
lution signal in an up-sampling circuit in the 
motion picture coding apparatus in Fig, 15; 
Fig. 19 is a block diagram showing a first exam- 
ple of a vertical up-sampling circuit portion of 
the up-sampling circuit in the motion picture 
coding apparatus in Fig. 15; 
Fig. 20 Is a block diagram showing a second 
example of the vertical up-sampling circuit por- 
tion of the up-sampling circuit in the motion 
picture coding apparatus in Fig. 15; 
Fig. 21 is a block diagram showing a third 
example of the vertical up-sampling circuit por- 
tion of the up-sampling circuit in the motion 
picture coding apparatus in Fig. 15; 
Figs. 22A and 22B are diagrams for explaining 
the operation of the vertical up-sampling circuit 
portion in Fig. 20; 

Figs. 23A and 238 are diagrams for explaining 
the operation of the vertical up-sampling circuit 
portion in Fig. 21 ; 

Fig. 24 is a block diagram exemplifying a mo- 
tion picture coding apparatus having a scalabil- 
ity; and 

Fig. 25 is a diagram showing the relation be- 
tween line positions of a high-resolution picture 
and a low-resolution picture. 

Best Mode of Carrying Out the Invention 

Embodiment of the present invention will now 
be described referring to the accompanying draw- 
ings. 
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Referring to Fig. 1, a description will be given 
of an embodiment which inhibits prediction for 
even fields using a low-resolution local decoded 
signal, but performs it for odd fields. 

This embodiment will be described with refer- 
ence to the case where a picture to be coded by a 
new system is an interlaced picture, a picture to be 
coded by the existing system is a non-interlaced 
picture prepared by dropping one field from the 
interlaced picture, and in the coding by the new 
system, an interlaced picture is coded with frame 
multiplexing. 

In Fig. 1, a terminal 10 where an input picture 
signal is to be input is connected to a dividing 
circuit 101 via a field merge circuit 100. The field 
merge circuit 100 performs field merging of a field 
picture to treat the field picture of an input picture 
signal as a frame picture, and the dividing circuit 
101 is provided to divide the frame picture into a 
plurality of blocks. The output of the dividing circuit 
101 is connected to one input terminal of a sub- 
tracter 12 and a predictor and prediction-mode 
decision unit 104. The other input terminal of the 
subtracter 12 is connected to the output terminal of 
the predictor and prediction-mode decision unit 
104. The output terminal of the subtracter 12 is 
connected to the input terminal of a discrete cosine 
transform (DCT) circuit 17. The output terminal of 
the DCT circuit 17 is connected to the input termi- 
nal of a variable-length coder 19 via a quantizer 18 
which quantizes DCT coefficient data from the DCT 
circuit 17. The output terminal of the variable- 
length coder 19 is connected to the input terminal 
of a buffer memory 20. This buffer memory 20 is 
connected to tlie quantizer and an output terminal 
21. 

The output terminal of the quantizer 18 is con- 
nected to one input terminal of an adder 24 via an 
inverse quantizer 22 and an inverse discrete cosine 
transform (inverse DCT) circuit 23. The adder 24 
adds inverse DCT data and a predictive signal 
together to produce a high-resolution local decoded 
signal. The other input terminal of the adder 24 is 
connected to the output terminal of the predictor 
and prediction-mode decision unit 104, and the 
output terminal of the adder 24 Is connected to a 
prediction circuit 134 (Fig. 4) of the predictor and 
prediction-mode decision unit 104 via a frame 
memory 27 for storing a high-resolution local de- 
coded signal. A prediction-mode decision unit 135 
(Fig. 4) of the predictor and prediction-mode de- 
cision unit 104 has its prediction-mode output ter- 
minal and a motion-vector output terminal con- 
nected to the variable-length coder 1 9. 

The input terminal 10 is connected to the input 
terminal of a coding section 30 of the existing 
system via a field skip circuit 102, a down-sam- 
pling circuit 29 and a dividing circuit 103 in a serial 



manner. The output terminal of the coding section 
30 is connected to an output terminal 32 via a 
buffer memory 31 and to the input terminal of a 
local decoder 33 of the existing system. The output 

6 terminal of the local decoder 33 is connected to a 
low-resolution prediction selector 132 (Fig. 4) of the 
predictor and prediction-mode decision unit 104 via 
an up-sampling circuit 35. 

The operation of the coding apparatus with the 

10 above structure will be described referring to Figs. 
2. 3A and 3B. 

Fig. 2 shows the relationship between to-be- 
coded pictures of various sizes. Figs. 3A and 3B 
show selectable candidates for low-resolution pre- 

76 diction in this embodiment. 

A field picture signal of an input picture signal 
input from the input terminal 10 is subjected to 
field merging in the field merge circuit 100 to be 
treated as a signal of a frame picture as shown in 

20 Fig, 2. The frame picture signal from the field 
merge circuit 100 is divided into a plurality of 
picture blocks by the dividing circuit 101 for cod- 
ing. Image data subjected to block segmentation 
after merged to a frame picture is such data that 

26 data of an odd field (Odd field) and data of an even 
field (Even field) appear alternately line by line as 
indicated on the left end in Figs. 3A and 3B. In the 
drawings, data of 4 x 4 blocks is shown for de- 
scriptive simplification, and o indicates data of an 

30 odd field and A indicates data of an even field. 

When a block of input data (input picture sig- 
nal) is Input to the predictor and prediction-mode 
decision unit 104, signals for various types of inter- 
frame prediction (by referring a picture in the frame 

35 memory 27) from a high-resolution picture, intra- 
frame prediction, or prediction from a low-resolu- 
tion picture (by referring a picture obtained by up- 
sampling a picture in the frame memory in the 
local decoder 33 of the existing system by the up- 

40 sampling circuit 35) considered as prediction can- 
didates are produced, and differences between 
these signals and the input signal are obtained. A 
prediction mode to minimize the difference is se- 
lected based on a certain evaluation criterion. For 

45 the prediction from the high-resolution picture 
whose detailed description will be omitted because 
it is not concerned with the subject matter of this 
invention, but, for example, a system currently con- 
sidered in MPEG2. 

50 In this invention, besides the system of produc- 

ing a prediction picture from a low-resolution pic- 
ture together with an odd line and an even line as 
shown in Fig. 3A, there is a system of producing a 
prediction picture from a low-resolution picture for 

55 odd lines and a prediction picture from a high- 
resolution picture for even lines and the reverse 
combination as shown in Fig. 38 (in the diagram, • 
indicates a low-resolution predictive signal). To pro- 
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Gi -.H those signals, the predictor and prediction- 
mocJQ decision unit 104 separates the input signal 
to odd lines and even lines, and obtains an optimal 
signal (double triangle in Fig. 3B) from all available 
signals as a high-resolution predictive signal for 
each line. Then, pixels at a position corresponding 
to the current coding block in the frame memory in 
the local decoder 33 of the existing system are 
read out, and up-sampled by the up-sampling cir- 
cuit 35. Finally, those pixels are alternately merged 
line by line to be a predictive signal. A predictive 
signal only from a low-resolution picture is pro- 
duced by up-sampling a picture in the frame mem- 
ory of the existing system. 

Further, matching is separately performed on 
the pixels o and A of the input picture while shifting 
pixels in the picture obtained by up-sampling a 
low-resolution picture to find the optimal amounts 
of shift, A signal merged line by line alternately 
with those pixels and a signal merged line by line 
with the predictive signal of a high-resolution pic- 
ture are produced in a simitar manner. 

In either case, three candidates are produced 
for a low-resolution predictive signal ((1) odd: low 
resolution, even: high resolution; (2) odd: high reso- 
lution, even: low resolution: (3) low resolution for 
both odd and even). The predictor and prediction- 
mode decision unit 104 selects a candidate, which 
optimizes the prediction error, from among those 
three candidates from the low-resolution predictive 
signal and candidates from the high-resolution pre- 
dictive signal, gives the optimal candidate as a 
predictive signal to the subtracter 12 and sends a 
prediction mode to the variable-length coder 19. 
The prediction mode is subjected to variable-length 
coding as type information and multiplexed for 
each prediction unit (e.g.. a macroblock for MPEG). 
In this embodiment, different type information are 
assigned to all the candidates for the predictive 
signal. 

Another embodiment is to indicate by 1-bit flag 
whether or not prediction from a low-resolution 
picture is to be included. 

Fig. 4 is a block diagram showing an example 
of the structure of the predictor and prediction- 
mode decision unit 104 in Fig. 1, and its operation 
will be described in association with the description 
of Figs. 3A and SB. First, an input block is sepa- 
rated to lines of pixels o and A by an even/odd 
numbered line separator 130, an optimal predictive 
signal (double triangle) is obtained from a high- 
resolution picture for each field by a high-resolution 
prediction selector 131. and a motion vector cor- 
responding to the optimal predictive signal is sent 
to prediction-mode decision unit 135. Then, an 
optimul predictive signal • is obtained from a low- 
resolution picture for each field by the tow-resolu- 
tion prediction selector 132, and a motion vector 



corresponding to the optimal predictive signal is 
sent, when necessary, to prediction-mode decision 
unit 135. 

An even/odd numbered line merge circuit 133 

5 executes three types of merging from a combina- 
tion of the above, yielding a predictive signal cor- 
responding to Figs. 3A and 38, and this signal is 
sent to the prediction-mode decision unit 135. The 
high-resolution prediction circuit 134 prepares a 

10 high-resolution predictive signal other than the sent 
predictive signal (e.g., a signal corresponding to 
prediction that is currently under consideration in 
MPEG2), and it is sent together with a correspond- 
ing motion vector to the prediction-mode decision 

75 unit 135. The prediction-mode decision unit 135 
selects a predictive signal, which minimizes the 
prediction error, from among all the received pre- 
dictive signals , and sends it to the subtracter 12. 
The prediction-mode decision unit 135 also selects 

20 a motion vector, which minimizes the prediction 
error, from among all the received motion vectors, 
and sends it to the variable-length coder 1 9. 

The optimal shift amount detected for a low- 
resolution picture may be subjected as motion vec- 

25 tor information to variable-length coding in the vari- 
able-length coder 19 as separate from the motion 
vector information of the high-resolution picture, 
and may be multiplexed before transmission. As 
there is a low possibility of selecting a mode for 

30 predicting only that field dropped out by the field 
skipping from a low-resolution signal, this mode 
may be eliminated from selective targets in this 
embodiment. Further, the shifting may be restricted 
to the prediction of only the field dropped out by 

35 the field skipping. 

Another embodiment of this invention will be 
described below. 

Fig. 5 is a diagram showing the structure of the 
predictor and prediction-mode decision circuit in 

40 this embodiment; same reference numerals as 
used In Fig. 4 are used for same portions. 

First, candidates for a high-resolution predictive 
signal are produced by the high-resolution predic- 
tion circuit 134 as in the embodiment shown in Fig. 

4S 4. They include the detection of forward and back- 
ward motion vectors. An input is separated into 
even and odd lines by the even/odd line separator 
130, and an optimal high-resolution picture in the 
case where prediction is performed with a high- 

50 resolution picture for each field is selected by the 
high-resolution prediction selector 131. This em- 
bodiment differs from the above-described embodi- 
ment in the way of producing a low-resolution 
predictive signal, which is selected by the low- 

55 resolution prediction selector 132 and used for 
prediction of that field dropped out by the field 
skipping. Motion vector detection is performed and 
a low-resolution predictive signal is prepared using 
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motion compensation in the above-described em- 
bodiment, whereas a low-resolution predictive sig- 
nal is prepared using a motion vector which is 
derived by scaling the motion vector obtained for a 
candidate for high-resolution prediction down to 
that corresponding to one field time interval. This 
will be described referring to Fig. 6. Using a motion 
vector given as a candidate for high-resolution pre- 
diction for the prediction of an even field, for exam- 
ple, a vector V1 in the diagram (indicated by a 
broken line in the diagram), the vector of a motion 
vector VI • for low-resolution prediction (indicated 
by a thick line in the diagram) will be obtained. 
When an even field to be predicted is apart from 
that field which is used for prediction by n fields, 
as illustrated, the motion vector is given by 

vr = Vl/n (1) 

Since this motion vector is directly used for 
low-resolution prediction in this embodiment, it is 
unnecessary to send the motion vector for low- 
resolution prediction. 

Here, search Is conducted in the determined 
search area around this motion vector to obtain an 
optimal motion vector again for low-resolution pre- 
diction. In this example, if the optimal motion vec- 
tor is a vector V1", the difference between this 
vector VI " and the reference vector VI ' 

AV = VI " - vr (2) 

is sent as a difference vector. 

The candidate for the predictive signal pre- 
pared on the low-resolution side and the candidate 
for the predictive signal on the high-resolution side 
are averaged by an averaging circuit 140, and are 
merged by the even/odd line merge circuit 133 to 
be candidates for a low-resolution predictive signal. 
From among those candidates and the candidates 
from the high-resolution predictive signal, a predic- 
tive signal candidate which minimizes the predic- 
tion error is selected by the prediction-mode de- 
cision unit 135. 

According to this embodiment, as the motion 
vector of a low-resolution picture is prepared from 
the motion vector for a high-resolution picture, both 
should correspond to each other. In other words, 
the vector VV is used on the low-resolution side 
when a candidate for which the vector VI, for 
example, in Fig. 6 is used is selected, and the 
vector V2' is used on the low-resolution side when 
a candidate for which the vector V2 is used is 
selected. There are three ways of prediction on the 
high-resolution side: forward direction, backward 
direction and bidirectional. In the bidirectional case, 
a motion vector on the field side closer to a to-be- 
predicted field has only to be used. 



Fig. 7 is a block diagram of a motion picture 
coding apparatus according to the second embodi- 
ment of this invention. This embodiment will be 
described with reference to the case where a pic- 
5 ture to be coded by the new system is an inter- 
laced picture, a picture to be coded by the existing 
system is a non-interlaced picture prepared by 
dropping one field from the interlaced picture. 

In Fig. 7, a high-resolution picture signal is 
10 input as an input picture signal 11 to a terminal 10. 
This input picture signal 1 1 is input to a subtracter 
12 where the difference between this signal and a 
high-resolution predictive signal 13 is obtained, 
yielding a high-resolution prediction error signal 14. 
75 A first selector switch 15 selects one of the input 
picture signal 11, the high-resolution prediction er- 
ror signal 14 and a low-resolution predictive signal 
16 under the control of a prediction-mode decision 
circuit 37. 

20 The signal selected by the selector switch 15 

is subjected to DCT coding in a DCT (Discrete 
Cosine Transform) circuit 17. DCT coefficient data 
obtained by the DCT circuit 17 is quantized by an 
quantizer 18. The signal quantized by the quantizer 
25 18 is sent to two ways; one is subjected to vari- 
able-length coding in a variable-length coder 19 
and is then sent via a buffer 20 to a transmission 
system or a storage system at a predetermined bit 
rate from an output terminal 21. The subtracter 12, 
30 DCT circuit 17, quantizer 18 and variable-length 
coder 19 constitute the first coding means. 

The other one of the signal quantized by the 
quantizer 18 and branched to two ways is sequen- 
tially subjected to opposite processes to those of 
35 the quantizer 18 and DCT circuit 17 in an inverse 
quantizer 22 and an inverse DCT circuit 23, and is 
then added to a signal selected by a second selec- 
tor switch 25 in an adder 24, yielding a high- 
resolution local decoded signal 26. The inverse 
40 quantizer 22, inverse DCT circuit 23 and adder 24 
constitute the first local decoding means. 

The selector switch 25 is controlled, interlocked 
with the selector switch 1 5, by the prediction-mode 
decision circuit 37, and selects "0" when the selec- 
ts tor switch 15 selects the input picture signal 11, the 
high-resolution predictive signal 13 when the switch 
15 selects the high-resolution prediction error sig- 
nal 14, and selects the low-resolution predictive 
signal 16 when the switch 15 selects the low- 
so resolution predictive signal 16. 

The high-resolution local decoded signal 26 
output from the adder 24 is written in a frame 
memory 27 having a memory capacity for a plural- 
ity of frames. The output of the frame memory 27 
55 is input to a predictor 28 and is used to prepare 
the high-resolution predictive signal 13. The predic- 
tor 28 detects a motion vector between the input 
picture signal 1 1 . which is a high-resolution picture 
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signal, and the high-resolution local decoded signal 
fronn the frame memory 27, and perfornns motion 
compensation inter-frame (inter-field) prediction us- 
ing this motion vector to produce the high-resolu- 
tion predictive signal 13 and outputs motion vector 
information 38. The predictive signal thus obtained 
includes three ways of prediction, namely the for- 
ward prediction, the backward prediction and the 
bidirectional prediction, as described in the afore- 
mentioned document, and optimal prediction 
among those will be selected. 

The input picture signal 11 is also input to a 
down-sampling circuit 29. which is means for con- 
verting a high-resolution picture signal to a low- 
resolution picture signal, and is down-sampled 
there to be converted into a low-resolution picture 
signal of a format, such as GIF or SIF. An example 
of the process in the down-sampling circuit 29 is 
down-sampling by dropping one field off as shown 
in Fig. 8A. A low-resolution picture signal indicated 
by a small rectangle is prepared by dropping off an 
even field of a high-resolution picture signal in- 
dicated by an elongated, large rectangle and sub- 
sampling an odd field in the horizontal direction 
after putting it through a low pass filter for remov- 
ing horizontal turn-around. 

The low-resolution picture signal of the GIF or 
SIF format obtained by the down-sampling circuit 
29 is coded by a coding section 30 based on the 
existing system (e.g., H. 261 or MPEGl), which is 
the second coding means, and is then sent via a 
buffer 31 to the transmission system or storage 
system at a predetermined bit rate from an output 
terminal 32. The coded output from the coding 
section 30 is subjected to local decoding by a local 
decoder 33 based on the existing system (H261 or 
MPEGl), which is the second local decoding 
means. A low-resolution local decoded signal 34 
output from the local decoder 33 is input to the 
coding section 30 to prepare a predictive signal 
which is used in predictive coding in the coding 
section 30, and is up-sampled by an up-sampling 
circuit 35 to be also used to prepare the low- 
resolution predictive signal 16 in a second predic- 
tor 36. 

The predictor 36 detects a motion vector be- 
tween the input picture signal 11, which is a high- 
resolution picture signal, and a signal obtained by 
up-sampling a low-resolution local decoded signal 
in the up-sampling circuit 35, and performs motion 
compensation on the up-sampled signal of the low- 
resolution local decoded signal using this motion 
vector to produce the low-resolution predictive sig- 
nal 16, and outputs motion vector information 39. 

Referring to Fig. SB, a description will now be 
given of the advantage of performing motion com- 
pensation on the low-resolution local decoded sig- 
nal in the predictor 36. If the low-resolution picture 



signal prepared by down-sampling the picture sig- 
nal as shown in Fig. 8A is put through the coding 
section 30 and the local decoder 33 to produce the 
low-resolution local decoded signal 34 and this 

5 signal 34 is up-sampled in the up-sampling circuit 
35, when used to predict an odd field, it ensures 
good prediction because It positionally matches 
with the original odd field. When it is used to 
predict an even field, however, good prediction is 

10 not possible for an area where there is motion due 
to a mutual time shift. If the up-sampled signal of 
the low-resolution local decoded signal is shifted 
by a shift amount corresponding to the motion in 
the field period as shown in Figs. 8A and 8B. 

75 however, the signal positionally matches with an 
even field, making it possible to perform accurate 
prediction using the shifted low-resolution local de- 
coded signal. In this case, therefore, the motion 
compensation by the predictor 36 basically need to 

20 be performed only at the time of predicting an 
even field. As the execution of motion compensa- 
tion reduces a prediction error in an odd field, 
however, the motion compensation may be per- 
formed with respect to both fields. 

25 The prediction-mode decision circuit 37 deter- 

mines a prediction mode at the time of coding the 
input picture signal 11. and controls the selector 
switches 1 5 and 25 in accordance with the decision 
result. The prediction-mode decision circuit 37 out- 

30 puts a prediction mode signal 40 that indicates 
which prediction mode has been selected. The 
prediction mode signal 40 is input to the variable- 
length coder 19 to be subjected to variable-length 
coding together with the quantized signal from the 

35 quantizer 18 and motion vector information 38 and 
39 output from the predictors 28 and 36. 

Fig. 9 shows a function for determining which 
one is to be selected in the prediction-mode de- 
cision circuit 37, the high-resolution predictive sig- 

40 nal 13 from the predictor 28 or the low-resolution 
predictive signal 16 from the predictor 36. With the 
high-resolution predictive signal 1 3 in use, informa- 
tion of one or two motion vectors should be sent as 
the motion vector information 38. With the low- 

45 resolution predictive signal 16 in use, however, 
information of zero or one motion vector (one or 
two in the case where motion compensation from a 
low-resolution picture signal is to be also per- 
formed with respect to an odd field) has only to be 

50 sent as the motion vector information 39. 

As mentioned above, the motion vector in- 
formation 38 and 39 are transmitted after being 
subjected to variable-length coding in the variable- 
length coder 19. The numbers of bits (the bit 

55 amounts of variable-length codes) used in sending 
those motion vector information 38 and 39 is es- 
timated and the difference therebetween is denoted 
by MVbitdiii- Now, let us consider requantization of 



10 



17 



EP 0 631 444 A1 



18 



the prediction error signal of the low-resolution 
picture signal with this number of bits. 

Given that MSEpredi indicates the prediction 
error power of the predictive signal 14 when the 
high-resolution predictive signal 13 from the predic- 5 
tor 28 is used and MSEpreda indicates the predic- 
tion error power of the predictive signal 14 when 
the low-resolution predictive signal 16 from the 
predictor 36 is used, MSEpredi and MSEpred2 x 
2-2MVbridifi jg compared. When the result of the io 
comparison decision is 



MSEpredl < MSEpred2 x 2 



-2MVbhdifl 



(3). 



76 



the high-resolution predictive signal 13 Is selected 
by the selector switches 15 and 25, and when 

MSEpredl > MSEp,ed2 X 2-2MVbltdiH (4)^ 

the low-resolution predictive signal 16 is selected 20 
by the selector switches 15 and 25. Then, the 
optimal predictive coding of the Input picture signal 
1 1 becomes possible while suppressing the 
amount of information to be sent. This Is particu- 
larly effective in the case where it is unnecessary 25 
to send motion vector with respect to the low- 
resolution predictive signal 16, e.g., when the low- 
resolution predictive signal 16 is prepared from a 
frame formed with merged fields. 

The decision function a in Fig. 9 may be a 30 
function where a = 2" ^MVbitdifi^ ^^at is, a function 
whose inclination is changed by MVbhdiff- This func- 
tion can likewise used to compare predictive sig- 
nals of a high-resolution picture with each other. 

Fig. 10 Is a block diagram showing a specific 35 
example of the structure of the prediction-mode 
decision circuit 37 shown in Fig. 7 based on the 
above-described principle. The input picture signal 
(high-resolution picture signal) 11 and the high- 
resolution predictive signals 13 from the predictor 40 
28 are input to a high-resolution prediction-mode 
decision circuit 41 where one of the high-resolution 
predictive signals 13 is selected. The difference 
between the selected high-resolution predictive sig- 
nal and the input picture signal 11 is obtained by a 45 
subtracter 44, yielding a high-resolution prediction 
error signal. The low-resolution predictive signal 16 
from the predictor 36 is input to a subtracter 45, 
which obtains the difference between this signal 
and the input picture signal 11, yielding a low- 50 
resolution prediction error signal. The motion vector 
information 38 and 39 from the predictors 28 and 
36 are input to motion-vector Information amount 
estimating circuits 42 and 43 to estimate the 
amounts of motion-vector information or the 55 
amounts of generated information from the vari- 
able-length coder 19 which correspond to the mo- 
tion vector 38 and 39. A subtracter 46 obtains the 



difference between the estimated amounts of mo- 
tion-vector Information or the aforementioned 
MVbitdiff. 

An operation/decision circuit 47 obtains predic- 
tion error powers MSEp^edi and MSEpred2 of the 
high-resolution predictive signal and low-resolution 
predictive signal, and performs decision given by 
the equations (3) and (4) from the difference 
MVbiitdiii between the two amounts of motion-vector 
information, using the decision function shown in 
Fig. 9. In accordance with the decision result, the 
circuit 47 controls the selector switches 15 and 25 
in Fig. 7 and outputs the prediction mode signal 
40. 

According to the embodiment in Fig. 7. as 
described above, under the decision criterion 
where additional information, such as motion-vector 
information, as well as the prediction error power is 
considered with respect to both the high-resolution 
prediction error signal and the low-resolution pre- 
diction error signal, the predictive signal to be used 
In predictive coding of the high-resolution input 
picture signal is selected, thus advantageously im- 
proving the coding efficiency. 

The above embodiment may be worked out in 
various modifications as follows. 

(1) Although a low-resolution picture signal is 
prepared from an odd field, preceding by time, 
by dropping off one field in Fig. 8A, it may be 
prepared from an even field. Accordingly, with 
regard to the prediction of an odd field, the 
prediction from the low-resolution local decoded 
signal is backward prediction, so that further 
improvement on the prediction efficiency will be 
expected. 

(2) The low-resolution predictive signal for the 
prediction of an even field may also be obtained 
by spatial-temporal interpolation as shown in 
Fig. 11, not by motion compensation prediction. 
More specifically, the Individual pixels of a low- 
resolution prediction new word of an even field 
are interpolated with a signal of those pixels of 
an odd field which are adjacent along the time 
axis t of the low-resolution local decoded signal 
and also along the spatial axis (vertical direc- 
tion) V. In this case, it Is unnecessary to transmit 
motion-vector information corresponding to the 
low-resolution picture signal, thus Improving the 
coding efficiency. 

(3) In the case of performing motion compensa- 
tion on an even field too. the motion vector in 
use can be estimated from a previous motion 
vector or the motion vector used for coding a 
low-resolution picture. In this case too, it is un- 
necessary to transmit motion-vector information 
corresponding to the low-resolution picture sig- 
nal. 
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(4) A function to produce a predictive signal 
using both the high-resolution local decoded sig- 
nal and the low-resolution local decoded signal 
may be added to the predictor 28. In this case, 
as shown in Fig. 12, the high-resolution signal 
(high-resolution local decoded signal) and the 
low-resolution signal (low-resolution local de- 
coded signal) are filtered by a high-pass filter 1 
and a low-pass filter 2 whose pass-bands have a 
mutually complementary relation, and are then 
added to form a predictive signal. The low- 
frequency component of the thus obtained pre- 
dictive signal is produced from a low-resolution 
predictive signal closer in terms of time, and the 
high-frequency component is produced from the 
high-resolution picture, so that the improvement 
on the prediction efficiency is expected. Further, 
the addition of the two local decoded signals will 
reduce quantized noise. 

(5) At the time of detecting a motion vector 
between the input picture signal 11 (high-resolu- 
tion picture signal) and the high-resolution local 
decoded signal in the predictor 28, the amount 
of computation can be reduced by limiting the 
search range to around that motion vector which 
is the motion vector between the low-resolution 
picture signals, obtained by the coding section 
30, enlarged to a motion vector corresponding 
to the high-resolution picture. 

A motion picture coding apparatus according to 
the third embodiment of this invention will now be 
described referring to Fig. 13. In this embodiment, 
four selector switches 51 . 54. 55 and 56 for selec- 
tion of a prediction mode are provided, and those 
switches are controlled by a prediction-mode de- 
cision circuit 37. The prediction-mode decision cir- 
cuit 37 may have the structure as shown in Fig. 10. 
or may have the same structure as the prior art. 
The selector switch 51 selects either an input pic- 
ture signal 11 or a high-resolution low-resolution 14 
output from a subtracter 12. The selector switch 54 
selects either a signal 52 selected by the selector 
switch 51 or a low-resolution prediction error signal 
from a subtracter 53 (a difference signal between 
the signal 52 and a low-resolution predictive signal 
16). The signal selected by this selector switch 54 
is input to a DCT circuit 17. 

The selector switch 55 is controlled, interlocked 
with the selector switch 51. by the prediction-mode 
decision circuit 37, and selects "0" when the selec- 
tor switch 51 selects the input picture signal 11. 
and the high-resolution predictive signal 13 when 
the switch 51 selects the high-resolution prediction 
error signal 14. Likewise, the selector switch 56 is 
controlled, interlocked with the selector switch 54, 
by the predictiorr-rnode decision circuit 37, and 
selects "0" when the selector switch 54 selects the 
signal from the selector switch 51. and the low- 



resolution predictive signal 16 when the switch 54 
selects the low-resolution prediction error signal. 

The signal selected by the selector switch 55 
is added to a signal from an inverse DCT circuit 23 

5 in an adder 24a, and the signal selected by the 
selector switch 56 is added to a signal output from 
the adder 24a in an adder 24b, thus producing a 
local decoded signal 26. 

As in the embodiment of Fig. 7, a predictor 28 

10 detects a motion vector between the input picture 
signal 11. which is a high-resolution picture signal, 
and the high-resolution local decoded signal from a 
frame memory 27, and performs motion compen- 
sation inter-frame (inter-field) prediction using this 

76 motion vector to produce the high-resolution pre- 
dictive signal 13 and outputs motion vector in- 
formation 38. The predictive signal thus obtained 
includes three ways of prediction, namely the for- 
ward prediction, the backward prediction and the 

20 bidirectional prediction, and optimal prediction 
among those will be selected. 

The motion vector information 38 obtained by 
the predictor 28 is sent to a predictor 36 as well as 
to a variable-length coder 19. The predictor 36 

25 accesses a frame memory 57. which stores plural 
frames of low-resolution local decoded signals from 
a local decoder 33, performs motion compensation 
on a signal Input via an up-sampling circuit 35 from 
the frame memory 57 based on the motion vector 

30 information sent from the predictor 28, using the 
same method as the predictive signal preparing 
method selected by the predictor 28. and outputs 
the motion-compensated low-resolution predictive 
signal 16. 

35 Fig. 14 is a diagram showing how predictive 

coding of a high-resolution picture signal is carried 
out In this embodiment. The predictor 28 outputs 
the high-resolution predictive signal 13. obtained 
by performing motion compensation on a high- 

40 resolution picture signal (high-resolution local de- 
coded signal) at t = t1, and the subtracter 12 
outputs the high-resolution prediction error signal 
14. which is the difference between the high-reso- 
lution predictive signal 13 and the high-resolution 

45 picture signal 11 at t = tO. The predictor 36 
outputs a low-resolution prediction error signal, 
which is the difference between a low-resolution 
predictive signal, which is obtained by performing 
motion compensation on a low-resolution picture 

60 signal (low-resolution local decoded signal) at t = 
t1 , and a low-resolution local decoded signal at t = 
to, as the low-resolution predictive signal 16. This 
low-resolution predictive signal 16 is supplied to 
the subtracter 53 as a predictive signal tor inter- 

55 frame (or inter-field) predictive coding on the high- 
resolution prediction error signal 1 4. 

According to the embodiment in Fig. 13, as 
described above, there is a mode for further per- 
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forming predictive coding of the high-resolution 
prediction error signal 14 using the low-resolution 
prediction error signal as a predictive signal, so 
that the local decoded signal by the existing sys- 
tem can be used more effectively in the predictive s 
coding of the high-resolution picture signal, thus 
improving the coding efficiency, which is an advan- 
tage. 

According to the above embodiment, it is pos- 
sible to effectively use the prediction ability using a io 
low-resolution local decoded signal in the predic- 
tive coding of a high-resolution picture signal in the 
motion picture coding apparatus of an embedded 
coding system using hierarchical coding, thus im- 
provising the coding efficiency. 76 

Now. a motion picture coding apparatus ac- 
cording to a fourth embodiment of this invention 
will be described referring to Fig. 15. 

In Fig. 15, a high-resolution picture signal, e.g.. 
an HDTV signal, is input as an input picture signal 20 
11 to a terminal 10. This input high-resolution pic- 
ture signal 1 1 is input to a subtracter 12 where the 
difference between this signal and a predictive 
signal 13 is obtained, yielding a prediction error 
signal 14. 25 

A first selector switch 15 selects either the 
input high-resolution picture signal 11 or the pre- 
diction error signal 14. The signal selected by the 
selector switch 15 is subjected to DCT in a DCT 
(Discrete Cosine Transform) circuit 17. DCT coeffi- 30 
cient data obtained by the DCT circuit 17 is quan- 
tized by an quantizer 18. The signal quantized by 
the quantizer 1 8 is sent to two ways; one is sub- 
jected to variable-length coding in a variable-length 
coder 19 and is then sent via a buffer 20 to a 35 
transmission system or a storage system (not 
shown) at a predetermined bit rate from an output 
terminal 21. 

The subtracter 12, selector switch 15, DCT 
circuit 17, quantizer 18 and variable-length coder 40 
19 constitute a high-resolution picture coding cir- 
cuit based on a motion-compensation adaptive pre- 
dictive coding system. 

The other one of the signal quantized by the 
quantizer 18 and branched to two ways is sequen- 45 
tially subjected to opposite processes to those of 
the quantizer 18 and DCT circuit 17 In an inverse 
quantizer 22 and an inverse DCT circuit 23, and is 
then added to the predictive signal 13 in an adder 
24. The second selector switch 25 is controlled, 50 
interlocked with the selector switch 1 5, and selects 
the output of the inverse DCT circuit 23 when the 
first selector switch 15 selects the input high-reso- 
lution picture signal 11, and the output of the adder 
24 when the first selector switch 15 selects the 55 
prediction error signal 14, thereby producing a 
high-resolution local decoded signal 26. 



The inverse quantizer 22, inverse DCT circuit 
23, adder 24 and second selector switch 25 con- 
stitute a high-resolution local decoding circuit. 

The high-resolution local decoded signal 26 is 
written in a frame memory 27. The output of the 
frame memory 27 is input to a first predictor 28 
and is used to prepare a high-resolution predictive 
signal 40. The first predictor 28 detects a motion 
vector between the input high-resolution picture 
signal 11, which is a high-resolution picture signal, 
and the high-resolution local decoded signal from 
the frame memory 27, and performs motion com- 
pensation inter-frame (or inter-field) prediction us- 
ing this motion vector to produce the high-resolu- 
tion predictive signal 40 and outputs motion vector 
information 38. 

The prediction system in the first predictor 28 
may be a method for adaptively switching between 
inter-frame prediction and inter-field prediction, or 
may be a system as described in the document 
"One Method of Adaptive Motion Compensation 
prediction for Interlaced Image" by Otaka, 
Yamakage and Yamaguchi, Picture Coding Sympo- 
sium 1992 (PCSJ92), 5-13. 

The input high-resolution picture signal 11 is 
also input to a down-sampling circuit 29. which is 
means for converting a high-resolution picture sig- 
nal to a low-resolution picture signal, and is down- 
sampled there to be converted into a low-resolution 
picture signal of a format defined by, for example, 
CCIR. Rec. 601. This low-resolution picture signal 
is a signal which, like the high-resolution picture 
signal, has been subjected to interlace scanning. 
The vertical pixel positions of the high-resolution 
picture signal and low-resolution picture signal 
have a relationship as shown in Fig. 25. 

The down-sampling circuit 29 performs pro- 
cessing shown in Fig. 16 or Fig. 17, for example. 
Fig. 16 illustrates a process of executing down- 
sampling in a field. The high-resolution picture sig- 
nal is down-sampled in the horizontal direction and 
vertical direction every field, yielding a low-resolu- 
tion picture signal. As shown in Fig. 25, the rela- 
tionship between the vertical pixel positions of the 
high-resolution picture signal and low-resolution 
picture signal differs between odd fields and even 
fields, so that the vertical down-sampling to be 
executed for odd fields would differ from that for 
even fields. 

That is, down-sampling for odd fields is per- 
formed by executing filtering of odd taps using 
filter coefficients for odd fields, such as -29. 0, 88, 
138, 88, 0, and -29/256, whereas down-sampling 
for even fields is performed by executing filtering 
of even taps using filter coefficients for even fields, 
such as -4. 23, 109, 109. 23, and -4/256. 

Fig. 17 illustrates a process in a system of 
executing down-sampling after an interlaced picture 
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is converted to a non-interlaced picture. In the non- 
interlace conversion, a process or time spatial pro- 
cess adapted to motion is performed, as described 
in. for example, a document (1): C.L Lee, S. 
Chang, C.W. Jen. "Motion Detection and Motion 
Adaptive Pro-scan Conversion", 1991 IEEE Interna- 
tional Symposium on Circuits and Systems. Vol. 1, 
pp. 666-9. (June 1991), a document (2): A. Nguyen, 
Eric Dubois, "Spatio-temporal Adaptive Interlaced- 
to-progresslve Conversion", International Workshop 
on HDTV '92, and a document (3): V.G. Devereux, 
"Standards Conversion between 1250/50 and 
625/50 TV Systems", IBC 1992, pp. 51-5, (July, 
1992). 

Returning to Fig. 15, a low-resolution picture 
signal 42 obtained by the down-sampling circuit 29 
is coded by a low-resolution picture coding circuit 
of a motion-compensation adaptive predictive cod- 
ing system, which comprises a subtracter 43. a 
selector switch 44, a OCT circuit 45, a quantizer 46 
and a variable-length coder 47. The output of this 
low-resolution picture coding circuit is sent via a 
buffer 31 to a transmission system or a storage 
system from an output terminal 32 at a predeter- 
mined bit rate. The output of the quantizer 46 is 
subjected to local decoding by a low-resolution 
local decoding circuit, which comprises an inverse 
quantizer 48, an inverse DCT circuit 49, an adder 
50 and a selector switch 51 . The output of this low- 
resolution local decoding circuit (the output of the 
selector switch 51), 56, is sent to two ways; one is 
written in a frame memory 52. The output of the 
frame memory 52 is input to a predictor 53 and is 
used to prepare a predictive signal 55 for coding a 
low-resolution picture signal. 

Since the operations of the above-described 
low-resolution picture coding circuit, low-resolution 
local decoding circuit, buffer 31, frame memory 52 
and predictor 53 are the same as those of the 
aforementioned high-resolution picture coding cir- 
cuit, high-resolution local decoding circuit, buffer 
20, frame memory 27 and predictor 28, except for 
the difference in resolution of signals to be treated, 
their detailed descriptions will be omitted. 

The other part of the branched output 56 of the 
low-resolution local decoding circuit is input to an 
up-sampling circuit 60 to be up-sampled, yielding 
an up-sampling picture signal 65 which is a high- 
resolution picture signal. This up-sampling picture 
signal 65 is used to prepare a low-resolution pre- 
dictive signal 62 in a second predictor 61, 

A third predictor 63 produces a predictive sig- 
nal 13 based on the high-resolution predictive sig- 
nal 40 and the low-resolution predictive signal 62. 
The third predictor 63 may perform its processing 
to select either the high-resolution predictive signal 
40 or the low-resolution predictive signal 62 as the 
predictive signal 13, or may produce a signal, 



obtained by weighting and adding those signals, as 
the predictive signal 13. The selection between the 
high-resolution predictive signal 40 and the low- 
resolution predictive signal 62, or the determination 

5 of the weighting coefficient has only to be per- 
formed by selecting the one which makes the 
square errors of the predictive signal 13 and the 
input high-resolution picture signal 11. The third 
predictor 63 outputs information, which indicates 

10 which signal, the high-resolution predictive signal 
40 or the low-resolution predictive signal 62, has 
been selected as the predictive signal 13, or in- 
formation 64 indicating the weighting coefficient 
used in prediction. This information is multiplexed 

75 with the output of the high-resolution picture coding 
circuit in the variable-length coder 19 before being 
sent out from an output terminal 21. 

The up-sampling circuit 60, which character- 
izes this invention, will be described in detail be- 

20 low. Fig. 18 is for explaining the process of the up- 
sampling circuit 60. As illustrated, the low-resolu- 
tion local decoded signal 56, which is the output of 
the low-resolution local decoding circuit, is up-sam- 
pled to be doubled in the horizontal direction, and 

25 is then subjected to vertical up-sampling, thus 
yielding the up-sampling picture signal 65. which is 
a high-resolution picture signal. 

Rg. 19 is a block diagram showing a vertical 
up-sampling circuit portion of the up-sampling cir- 

30 cuit 60, which performs vertical up-sampling in Fig. 
18. In this vertical up-sampling circuit portion, an 
intra-field up-sampling signal and an intra-frame 
up-sampling signal are adaptively weighted and 
added. 

35 In other words, a signal 401 obtained by hori- 

zontal up-sampling of the low-resolution local de- 
coded signal 56, which is input from a horizontal 
up-sampling circuit portion In the up-sampling cir- 
cuit 60, which performs horizontal up-sampling in 

40 Fig. 18. is separated Into an odd-field signal 403 
and an even-field 404 in a first field separator 402. 
The odd-field signal 403 and even-field signal 404 
are subjected to vertical Interpolation in an odd- 
numbered field intra-field interpolation circuit 405 

45 and an even-numbered field intra-field interpolation 
circuit 406, respectively, using the signals in the 
individual fields, thus producing an odd-field intra- 
field interpolation signal 407 and even-field Intra- 
field interpolation signal 408. 

50 The odd-field intra-field interpolation circuit 405 

and the even-field intra-field interpolation circuit 
406 perform processes to prepare a signal marked 
by "X" from a signal marked by "o" and interpo- 
late a signal marked by "A" with a signal marked 

55 by in Fig. 25, and, more specifically, perform 
the computations to execute linear interpolation of, 
for example, upper and lower lines in accordance 
with the distance, which are expressed by the 
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following equations. 















fse(r2) 


= (3-fe(i) + fe(i-1))/4 




fse(i*2 + 


1) = (3fe(i) + fe(i + l))/4 




where 






fo : 


odd-field signal 403. 




fe : 


even-field signal 404, 




fso: 


odd-field intra-field interpolation 


signal 




407. 




fse: 


even-field intra-field interpolation 


signal 




408. and 




i : 


line number 





An odd-numbered field intra-frame interpolation 
circuit 447 and an even-numbered field intra-frame 
interpolation circuit 448 perform field-overlapped 
vertical interpolation using signals of two fields to 
produce an odd-field intra-frame interpolation signal 
409 and an even-field intra-frame interpolation sig- 
nal 410. This process is to interpolate a signal 
marked by "X" and a signal marked by "A" using 
both of a signal marked by "o" and a signal 
marked by and performs computations ex- 
pressed by, for example, the following equations. 

fto(i*2) = fo{i) 
fto(i*2 + 1) = fe(i) 
fte(i*2) = (fo(i) + fe(i))/2 
fte(i*2-H) = (fe(i) -i- fo(i + 1))/2. 

The odd (even)-fleld intra-field interpolation sig- 
nal 407 (408) and the odd (even)-field Intra-frame 
interpolation signal 409 (410) are respectively mul- 
tiplied by weighting coefficients Wo(We) and 1-Wo- 
(1-We) in multipliers 416 (417) and 418 (419). and 
are then added by an adder 420 (421). yielding an 
odd (even)-field up-sampling picture signal 422 
(423). 

The weighting coefficients Wo and We (0 ^ 
Wo, We ^ 1) are determined with the input high- 
resolution picture signal 11 taken as a reference 
signal. More specifically, the input high-resolution 
picture signal 11 is separated, by a second field 
separator 411. into a high-resolution odd-field sig- 
nal 412 and a high-resolution even-field signal 413, 
which are in turn input to an odd-numbered field 
weighting decision circuit 414 and an even-num- 
bered field weighting decision circuit 415. respec- 
tively. For instance, of several candidates for the 
weighting coefficients, those which minimize the 
square mean values or the sums of the absolute 
values of the differences between the up-sampling 
picture signals 422 and 423 and the input signals 
412 and 413, have only to be selected. Alter- 
natively, coding may be tried to select that weigh- 
ting coefficient which minimizes the amount of 



codes. Selective information 430 and 431 of the 
weighting coefficients are sent as side information 
440. 

The candidates for the weighting coefficients 

5 Wo and We may only be 0 and 1. In this case, 
either the intra-field up-sampling or the intra-frame 
up-sampling is used, so that the multipliers 416 to 
419 and the adders 420 and 421 are unnecessary, 
and instead, a switch should be provided which 

10 selects either the intra-frame up-sampling signal or 
the intra-field up-sampling signal for each field in 
accordance with motion decision information. 

The weighting coefficients may be shared with 
the other selective information necessary for cod- 

76 ing. thus eliminating the need for the side informa- 
tion. For instance, in the case of the coding system 
for switching between frame DCT and field DCT for 
each given area of a picture, the weighting coeffi- 
cients may be determined based on this switching 

20 information. Conversely, the weighting coefficients 
may be determined in the previously described 
manner, and whether to use the frame DCT or field 
DCT may be determined based on the coefficients. 
In the case of the system of switching prediction 

25 methods, such as motion compensation, for each 
given area of a picture, the weighting coefficients 
may be determined based on this switching in- 
formation or the motion vector. 

Further, the weighting coefficients may be de- 

30 termined based on the already-encoded informa- 
tion, such as the coding mode, the motion vector 
the size of an inter-frame difference of a local 
decoded picture for an already-coded area around 
an area to be coded, the coding mode, the motion 

35 vector and the size of an inter-frame difference of a 
local decoded picture for an already-coded frame, 
and the local decoded picture, thereby reducing 
the amount of the weighting coefficient information. 
Fig. 20 is a block diagram showing the second 

40 example of the vertical up-sampling circuit portion, 
of the up-sampling circuit 60. In Fig. 20, first and 
second field separators 502 and 511 perform the 
same processes as the first and second field sepa- 
rators 402 and 411 in Fig. 19. 

45 An odd-numbered field motion-compensation 

up-sampling circuit 505 and even-numbered field 
motion-compensation up-sampling circuit 506 per- 
form up-sampling motion-compensated from an 
odd-field signal 503 and an even-field signal 504 

50 based on motion vectors 509 and 510 detected by 
first and second motion-vector detectors 507 and 
508. yielding an odd-field up-sampling picture sig- 
nal 422 and an even-field up-sampling picture sig- 
nal 423, respectively. 

55 The operations of the first and second motion- 

vector detectors 507 and 508 and the processes in 
the up-sampling circuits 505 and 506 in Fig. 20 will 
be described with reference to Figs. 22A and 22B. 
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The symbols "o," "X" and " "zii" In Rgs. 22A 
and 22B mean the same as those in Fig. 25. 

The first motion-vector detector 507 detects 
the motion vector from the even-field signal 504 to 
a high-resolution odd-field signal 512. With regard 
to that signal of the high-resolution odd-field sig- 
nals which is spatially located at the same position 
as the low-resolution odd-field signal (signal 
marked by "X" overlapping the mark "o" in Figs. 
22A and 22B). the motion vector will not be de- 
tected and the low-resolution signal at the same 
position at the up-sampling time is used directly. 
With regard to that signal which is not spatially 
located at the same position as the low-resolution 
odd-field signal (signal marked by "X" and not 
overlapping the mark "o" in Figs. 22A and 22B), 
the motion vector from the low-resolution even-field 
signal 504 (signal marked by "□") with the square 
mean value or the sum of the absolute values of 
the differences, etc.. taken as an evaluation refer- 
ence for every given unit area, or in such a way as 
to minimize the amount of codes originating from 
some temporary coding, and a signal obtained by 
subjecting the low-resolution even-field signal to 
motion compensation based on this motion vector. 

The second motion-vector detector 507 detects 
the motion vector from the odd-field signal 503 to a 
high-resolution even-field signal 513. At this time, 
either one of Figs. 22A and 228 is fixed, or motion 
compensation which provides a smaller square er- 
ror with, for example, the input high-resolution pic- 
ture signal 11 is selected, based on the motion 
vector 509 obtained by the first motion-vector de- 
tector 508. 

In Fig. 22A, a signal 652. which is obtained by 
subjecting a low-resolution odd-field signal to mo- 
tion compensation using a vector 651, attained by 
reversing the motion vector 509 from the low- 
resolution even field to the high-resolution odd field 
In both the horizontal and vertical directions and 
shifting it downward by +1/2, is used as an up- 
sampling signal for odd lines, and a signal 654, 
which is obtained by subjecting a low-resolution 
odd-field signal to motion compensation using a 
vector 653, attained by reversing the motion vector 
509 in both the horizontal and vertical directions is 
used directly or an interpolation signal with a low- 
resolution even-field signal 655 thereabove by one 
line is used as an up-sampling signal for even 
lines. 

In Fig. 228, a signal 662. which is obtained by 
subjecting a low-resolution odd-field signal to mo- 
tion compensation using a vector 661, attained by 
reversing the motion vector 509 in both the hori- 
zontal and vertical directions and shifting it upward 
by +1/2, is used as an up-sampling signal for even 
lines, and a signal 664, which is obtained by sub- 
jecting a low-resolution odd-field signal to motion 



compensation using a vector 663, attained by re- 
versing the motion vector 509 in both the horizontal 
and vertical directions is used directly or an Inter- 
polation signal with a low-resolution even-field sig- 

5 nal 665 thereunder by one line is used as an up- 
sampling signal for odd lines. 

Information 520 indicating which one of the 
motion vector 509 and Figs. 22A and 228 has been 
selected is sent as side information 440. 

10 The motion vector may be detected by con- 

ducting a search around a vector obtained by com- 
pensating the motion vector of a low-resolution 
picture or a high-resolution picture in accordance 
with the field time. Accordingly, even if the search 

76 range for the motion vector is made narrower than 
the case of the direct searching, nearly the same 
advantage will be obtained. In the case where the 
motion vector used in the low-resolution picture 
coding is compensated before its usage, this mo- 

20 tion vector information can also be obtained on the 
decoder side, so that the motion vectors obtained 
by the motion-vector detectors 507 and 508 and 
information about the difference between those mo- 
tion vectors may be transmitted. This will reduce 

25 the amount of the motion vector information. 

Fig. 21 is a block diagram showing the third 
example of the vertical up-sampling circuit portion 
of the up-sampling circuit 60. In Fig. 21. a field 
separator 602 performs the same processing as 

30 the field separator 402 in Fig. 19. 

An odd-numbered field motion-compensation 
up-sampling circuit 605 and an even-numbered 
field motion-compensation up-sampling circuit 606 
perform up-sampling of an odd-field signal 603 and 

35 an even-field signal 604, which has undergone mo- 
tion compensation based on the motion vectors 
609 and 610 computed by a motion-vector comput- 
ing circuit 607, thus yielding an odd-field up-sam- 
pling picture signal 422 and an even-field up-sam- 

40 pling picture signal 423. 

The motion-vector computing circuit 607 com- 
putes the motion vector which is obtained by com- 
pensating the motion vector 608, used in the pre- 
diction of the low-resolution signal coding, in accor- 
ds dance with the field distance. As the motion vector 
from the low-resolution odd-field signal to the high- 
resolution even-field signal, vectors 762 and 763 
attained by rounding a vector 761, obtained by 
compensating the low-resolution motion vector in 

50 accordance with the field distance as shown in, for 
example, Fig. 23 A, for each line of the low-resolu- 
tion, or a vector 764 obtained by performing the 
rounding for every 1/2 line, odd-field signal, may 
be used, and when the motion vector is rounded 

66 every 1/2 line, the signal of a non-integer line may 
be prepared by interpolation of the mentioned line. 

As the motion vector from the low-resolution 
even-field signal to the high-resoiution odd-field 
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signal, a vector 752 attained by rounding a vector 
751, obtained by connpensating the low-resolution 
motion vector in accordance with the field distance 
as shown in, for example, Fig. 23B. for each line of 
the low-resolution, or a vector 753 obtained by 5 
performing the rounding for every 1/2 line, odd- 
field signal, may be used. For odd fields, If there Is 
a line located at spatially the same position in the 
low-resolution picture odd-field, the signal of that 
line is used, and motion compensation will be iq 
performed only on those lines which are not lo- 
cated at spatially the same positions, 

A signal motion-compensated from the oppo- 
site field to the low-resolution local decoded signal 
may be used directly as an up-sampling signal for 76 
both the odd and even fields, or a signal weighted 
with the low-resolution local decoded signal of the 
same field may be used as an up-sampling signal. 
In this case, the weighting coefficient may be fixed 
or may be determined based on the magnitude of 20 
the motion vector, or the size of. the vector dropped 
off when the motion vector Is rounded. 

Since it is possible to compute the motion 
vector on the decoder side in the same procedures 
as the motion-vector computing circuit 607, it is 25 
unnecessary to send the motion vector. 

In the previous embodiments, when a signal 
obtained by weighting and adding the high-resolu- 
tion predictive signal 40 and the low-resolution pre- 
dictive signal 62 is to be added to a candidate for 30 
prediction, the weighting coefficient used in this 
weighting and addition may be selected in such a 
way as to minimize the square error with the high- 
resolution Input signal and may be sent as side 
information, or may be determined in accordance 35 
with the magnitude of the high-resolution motion 
vector. In addition, the weighting coefficient in use 
may be different for each field. In this case, since 
the high-resolution motion vector information can 
be obtained on the decoder side, the motion vector 40 
in use may be computed based on the high-resolu- 
tion motion vector compensated in accordance with 
the field time. 

Further, if the up-sampling circuit having the 
vertical up-sampling circuit portion shown In Fig. 20 45 
is used, the difference between the motion vectors 
used in coding a high-resolution picture may be 
obtained and sent as the motion vector information. 

As described above, this invention will provide 
a motion picture coding apparatus, which has a 50 
higher coding efficiency than the prior art, has a 
scalability to ensure simultaneous coding of a plu- 
rality of picture signals of different resolutions, sub- 
jected to interlace scanning, like an HDTV signal 
and the current TV signal, and which performs 65 
improved up-sampling at the time of performing 
prediction for the coding of an interlace-scanned 
high-resolution picture signal, obtained by up-sam- 
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pling an interlace-scanned low-resolution local de- 
coded signal, thus Improving the prediction effi- 
ciency. 

Claims 

1. A motion picture coding apparatus comprising: 

first coding means for prediction-coding a 
high-resolution interlaced picture as a frame 
picture; 

first local decoding means for decoding a 
coding result from said first coding means to 
obtain a high-resolution local decoded signal; 

first predicting means for attaining a high- 
resolution predictive signal from said high-res- 
olution local decoded signal; 

converting means for converting said high- 
resolution interlaced picture to a low-resolution 
non-interlaced picture; 

second coding means for coding said low- 
resolution non-interlaced picture signal; 

second local decoding means for decoding 
a coding result from said second coding 
means to obtain a low-resolution local decoded 
signal; 

up-sampling means for up-sampling said 
low-resolution local decoded signal; 

second predicting means for attaining a 
low-resolution predictive signal from a signal 
up-sampled by said up-sampling means; and 

selecting means for selecting, for each of 
even and odd fields, an optimal one of said 
low-resolution predictive signal and said high- 
resolution predictive signal, which are used 
when performing predictive coding as said 
frame picture, 

whereby a predictive signal is produced 
from said low-resolution predictive signal and 
said high-resolution predictive signal. 

2. The motion picture coding apparatus according 
to claim 1. wherein said second predicting 
means supplies a signal obtained by shifting 
said signal up-sampled by said up-sampling 
means by an amount corresponding to motion 
within a field period, to candidates for said low- 
resolution predictive signal. 

3. A motion picture coding apparatus comprising: 

first coding means for performing predic- 
tive coding on a high-resolution picture; 

first local decoding means for decoding a 
coding result from said first coding means to 
obtain a high-resolution local decoded signal; 

first predicting means for attaining a high- 
resolution predictive signal from said high-res- 
olution local decoded signal; 

converting means for converting said high- 
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resolution picture to a low-resolution picture; 

second coding nneans for coding said low- 
resolution picture signal; 

second local decoding means for decoding 
a coding result fronn said second coding 6 
means to obtain a low-resolution local decoded 
signal; 

up-sampling means for up-sampling said 
low- resolution local decoded signal; and 

second predicting means for attaining a io 
low-resolution predictive signal from a signal 
up-sampled by said up-sampling means, 

whereby as a prediction mode used when 
prediction-coding said high-resolution picture, 
a predictive signal is attained by a weighted 75 
sum of said low-resolution predictive signal 
and said high-resolution predictive signal. 

The motion picture coding apparatus according 
to claim 3, wherein an amount of shift cor- 20 
responding to motion in said field period is 
obtained by dividing a shift amount necessary 
to prepare said high-resolution predictive sig- 
nal by an inter-field distance, and a signal 
produced from a low-resolution predictive sig- 25 
nal obtained by said amount of shift and said 
high-resolution predictive signal is added to 
candidates for said predictive, signal. 

The motion picture coding apparatus according 30 
to claim 3, wherein an amount of shift cor- 
responding to motion in said field period is 
obtained as an optimal shift amount to op- 
timize an evaluation function attained by 
searching around a reference shift amount, ob- 35 
tained by dividing a shift amount necessary to 
prepare said high-resolution predictive signal 
by an inter-field distance, and a difference 
between said optimal shift amount and said 
reference shift amount is transmitted. 40 

The motion picture coding apparatus according 
to claim 3 or 5, wherein said first predicting 
means has means for outputting, as said high- 
resolution predictive signal, a signal obtained 45 
by adding signals, undergone conversion hav- 
ing a mutually complementary characteristic 
with respect to both of said high-resolution 
local decoded signal and said low-resolution 
local decoded signal. so 

A motion picture coding apparatus comprising: 
first coding means for performing predictive 
coding on a high-resolution picture signal; 

first local decoding means for decoding a 55 
coding result from said first coding means to 
obtain a high-resolution local decoded signal; 

first predicting means for attaining a high- 



resolution predictive signal from said high-res- 
olution local decoded signal; 

converting means for converting said high- 
resolution picture signal to a low-resolution pic- 
ture signal; 

second coding means for coding said low- 
resolution picture signal; 

second local decoding means for decoding 
a coding result from said second coding 
means to obtain a low-resolution local decoded 
signal; 

up-sampling means for up-sampling said 
low-resolution local decoded signal; 

second predicting means for attaining a 
low-resolution predictive signal from a signal 
up-sampled by said up-sampling means; and 

prediction-signal selecting means for se- 
lecting a predictive signal used by said first 
coding means from said high-resolution predic- 
tive signal and said low-resolution predictive 
signal under a determination criterion where 
prediction error power in cases when said 
high-resolution predictive signal and said low- 
resolution predictive signal are used in predic- 
tive coding in said first coding means, and an 
amount of additional information generated by 
said first coding means are considered. 

8. A motion picture coding apparatus comprising: 

first coding means having a mode for per- 
forming predictive coding on a high-resolution 
prediction error signal, which is a difference 
between a high-resolution picture signal and a 
high-resolution predictive signal, using a low- 
resolution prediction error signal; 

first local decoding means for decoding a 
coding result from said first coding means to 
obtain a high-resolution local decoded signal; 

first predicting means for detecting a mo- 
tion vector between said high-resolution local 
decoded signal and said high-resolution picture 
signal, and attaining said high-resolution pre- 
dictive signal undergone motion compensation 
using said motion vector; 

converting means for converting said high- 
resolution picture signal to a low-resolution pic- 
ture signal; 

second coding means for coding said low- 
resolution picture signal; 

second local decoding means for decoding 
a coding result from said second coding 
means to obtain a low-resolution local decoded 
signal; 

up-sampling means for up-sampling said 
low-resolution local decoded signal; and 

second predicting means for attaining a 
low-resolution predictive signal, undergone mo- 
tion compensation using said motion vector 
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obtained by said first predicting means, from a 
signal up-sampled by said up-sampling means, 
and obtaining a difference between said low- 
resolution predictive signal and said low-reso- 
lution local decoded signal to yield said low- 5 
resolution prediction error signal. 

9. A motion picture coding apparatus for perform- 
ing predictive coding on a high-resolution pic- 
ture signal and performing predictive coding of io 
a low-resolution picture signal obtained by con- 
verting said high-resolution picture signal, 
comprising: 

first predicting means for producing a 
high-resolution predictive signal based on a 75 
high-resolution local decoded signal decoded 
from a signal resulting from predictive coding 
of said high-resolution picture signal; 

up-sampling means for up-sampling a low- 
resolution local decoded signal decoded from 20 
a signal resulting from predictive coding of 
said low-resolution picture; 

second predicting means for producing a 
low-resolution predictive signal based on an 
output signal of said up-sampling means; and 25 

third predicting means for producing one 
of said high-resolution predictive signal, said 
low-resolution predictive signal and signals ob- 
tained by weighting and adding said high-reso- 
lution predictive signal and said low-resolution 30 
predictive signal, as a predictive signal used 
when performing predictive coding on said 
high-resolution picture signal, 

said up-sampling means including, 

(a) intra-field interpolation-signal producing 35 
means for producing an intra-field interpola- 
tion signal consisting only of one field of a 
signal of said low-resolution local decoded 
signal, 

(b) intra-frame interpolation-signal producing 40 
means for producing an intra-frame inter- 
polation signal consisting of consecutive two 
fields of signals of said low-resolution local 
decoded signal, and 

(c) selecting means for selecting, as said 45 
output signal of said up-sampling means, 

that one signal among said intra-field inter- 
polation signal, said Intra-frame interpolation 
signal and signals obtained by weighting 
and adding said interpolation signals, which so 
is adaptively determined based on at least 
one of said high-resolution picture signal 
and other information necessary for coding. 

10. A motion picture coding apparatus for perform- 55 
ing predictive coding on a high-resolution pic- 
ture signal and performing predictive coding of 

a low-resolution picture signal obtained by con- 



verting said high-resolution picture signal, 
comprising: 

first predicting means for producing a 
high-resolution predictive signal based on a 
high-resolution local decoded signal decoded 
from a signal resulting from predictive coding 
of said high-resolution picture signal; 

up-sampling means for up-sampling a low- 
resolution local decoded signal decoded from 
a signal resulting from predictive coding of 
said low-resolution picture; 

second predicting means for producing a 
low-resolution predictive signal based on an 
output signal of said up-sampling means; and 

third predicting means for producing one 
of said high-resolution predictive signal, said 
low-resolution predictive signal and signals ob- 
tained by weighting and adding said high-reso- 
lution predictive signal and said low-resolution 
predictive signal, as a predictive signal used 
when performing predictive coding on said 
high-resolution picture signal, 

said up-sampling means including, 

(a) first and second motion-compensation 
up-sampling means for performing motion- 
compensation up-sampling on consecutive 
two fields of said low-resolution local de- 
coded signal, and 

(b) motion-vector detecting means for de- 
tecting a motion vector used in said first 
and second motion-compensation up-sam- 
pling means, referring to said high-resolu- 
tion picture signal. 

11. A motion picture coding apparatus for perform- 
ing predictive coding on a high-resolution pic- 
ture signal and performing predictive coding of 
a low-resolution picture signal obtained by con- 
verting said high-resolution picture signal, 
comprising: 

first predicting means for producing a 
high-resolution predictive signal based on a 
high-resolution local decoded signal decoded 
from a signal resulting from predictive coding 
of said high-resolution picture signal; 

up-sampling means for up-sampling a low- 
resolution local decoded signal decoded from 
a signal resulting from predictive coding of 
said low-resolution picture; 

second predicting means for producing a 
low-resolution predictive signal based on an 
output signal of said up-sampling means; and 

third predicting means for producing one 
of said high-resolution predictive signal, said 
low-resolution predictive signal and signals ob- 
tained by weighting and adding said high-reso- 
lution predictive signal and said low-resolution 
predictive signal, as a predictive signal used 
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when performing predictive coding on said 
high-resolution picture signal, 

said up-sannpling means including, 

(a) first and second motion-compensation 
up-sampling means for performing motion- 5 
compensation up-sampling on consecutive 

two fields of said low-resolution local de- 
coded signal, and 

(b) vector computing means for computing 

a motion vector used in said first and sec- io 
ond motion-compensation up-sampling 
means, based on at least one of a motion 
vector used in predictive coding of said low- 
resolution picture signal and a motion vector 
used in predictive coding of said high-reso- is 
lution picture signal. 

12. A motion picture coding apparatus for perform- 
ing predictive coding on a high-resolution pic- 
ture signal and performing predictive coding of 20 
a low-resolution picture signal obtained by con- 
verting said high-resolution picture signal, 
comprising: 

first predicting means for producing a 
high-resolution predictive signal based on a 25 
high-resolution local decoded signal decoded 
from a signal resulting from predictive coding 
of said high-resolution picture signal; 

up-sampling means for up-sampling a low- 
resolution local decoded signal decoded from 30 
a signal resulting from predictive coding of 
said low-resolution picture; 

second predicting means for producing a 
low-resolution predictive signal based on an 
output signal of said up-sampling means; 35 

third predicting means for producing one 
of said high-resolution predictive signal, said 
low-resolution predictive signal and signals ob- 
tained by weighting and adding said high-reso- 
lution predictive signal and said low-resolution 40 
predictive signal, as a predictive signal used 
when performing predictive coding on said 
high-resolution picture signal; and 

weighting-coefficient determining means 
for determining a weighting coefficient used in 45 
said weighting and adding in said third predict- 
ing means, using at least one of (a) a motion 
vector used in predictive coding of said low- 
resolution picture signal, (b) a motion vector 
used in predictive coding of said high-resolu- so 
tion picture signal, (c) a motion vector used in 
motion-compensation up-sampling when said 
up-sampling means performs said motion-com- 
pensation up-sampling, and (d) an amount of 
shift in a case of predicting a signal obtained 56 
by shifting a picture resulting from up-sampling 
of a low-resolution picture by an amount cor- 
responding to motion within a field period. 
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